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Abstract: We extend the Maximum Likelihood method used by HiRes to study cross correlations be- 
tween a catalog of candidate astrophysical sources and Ultrahigh Energy Cosmic Rays (UHECRs), to 
allow for differing source luminosities. Our approach permits individual sources to be ranked according 
to their likelihood of having emitted the correlated UHECRs. We test both old and new method by simu- 
lations for various scenarios. We conclude that there are 9 true correlation between HiRes UHECRs and 
known BLLacs, with a 6 x 10 probability of such a correlation arising by chance. 
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Introduction 

Hints of excess correlations have been reported 
between ultrahigh energy cosmic rays (UHECRs) 
and BLLacs [4, 2, 1] and between UHECRs and 
x-ray clusters [3], The analyses in refs. [4, 2, 3] 
present the number of correlated events as a func- 
tion of angular separation between UHECR and 
source, and give the associated "chance probabil- 
ity" of finding a correlation at the observed level as 
a function of angular separation, in a large number 
of simulations with no correlations. 

In order to incorporate the experimental resolu- 
tion on an event-by-event basis, ref. [1] pro- 
posed a Maximum Likelihood-type procedure and 
applied it to studying correlations between UHE- 
CRs and BLLacs. This procedure (denoted the 
HiRes procedure, below) is motivated under the 
unphysical assumption that every candidate BLLac 
source has the same apparent luminosity. Even 
if BLLacs were standard candles with respect to 
UHECR emission, the BLLacs in the catalog have 
a large range of distances which would imply an 
even larger range of apparent luminosities, so one 
does not want to rely on such an assumption. 

In the present paper we introduce a ML prescrip- 
tion which avoids the assumption of equal apparent 
source luminosity and allows the potential sources 



to be ranked according to the probability that they 
have emitted the correlated UHECRs. We test and 
compare both methods with simulations. We find 
that the HiRes method gives the correct total num- 
ber of correlated events even when the sources do 
not have equal apparent luminosities, as long as the 
numbers of events are sufficiently low and the can- 
didate sources are not too dense or clustered them- 
selves. In general, our new method performs better 
in these more challenging cases, but very occasion- 
ally can be "tricked" by some special configura- 
tion. We apply the new procedure to BLLacs. 

Maximum Likelihood methods for the 
cross-correlation problem 

In the HiRes Maximum Likelihood method [ 1 ] the 
aim is to find, among N cosmic ray events, the cor- 
rect number of events, n, that are truly correlated 
with some sources, of which the total number is 
M. Hence, there will be N — n background events 
whose arrival directions are given by a probability 
density i?(x), which is simply the detector expo- 
sure to the sky as a function of angular position, 
x. For a true event with arrival direction s, the 
observed arrival direction is displaced from s ac- 
cording to a probability distribution Qj(x, s). For 
the analysis given in [1], Qi is taken to be a 2d 
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symmetric Gaussian of width equal to the resolu- 
tion <Tj, of the ith event. (Note that, throughout, the 
parameter a we quote appearing in a 2d Gaussian 
is related to ergs, the radius containing 68% of the 
cases by £7 68 = 1.51ct.) 

The probability density of observing the zth event 
in direction X; is 



Ef Q(x hSj )R( Sj ) | jy. 



N 



and the likelihood for a set of N events is defined 
to be £(n) = YiiLi Pi{ x i)> which is maximized 
when n is the true number of correlated events. 
Since £ is a very small number, which depends on 
the number of events, it is more useful to divide £ 
by the likelihood of the null hypothesis, i.e., n = 0, 
to form the likelihood ratio lZ(n) — £(n) / £(0). 
The logarithm of this ratio is then maximized to 
obtain the number of correlated events, n. 

To generalize this method to allow for sources 
with differing luminosities we assign a number of 
correlations, rij, separately for each source, with 



n = 
to 



2jf=i n r The probability density generalizes 
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where R s = J^f R ( s j)/ M - Maximizing In K 
gives the set {rij} of M numbers, containing the 
individual apparent source luminosities. We also 
get the set {(ln7£)j} of N numbers, providing in- 
formation about how strongly correlated the in- 
dividual cosmic ray events are to the catalog of 
sources. 

A crucial difference of the generalized method 
compared to the HiRes method is that n to t = 
E n j gives an estimate of all correlations, i.e., 
both true and random correlations, whereas the 
HiRes method yields only an estimate of the num- 
ber of true correlations. A crude estimate of the 
number of true correlations for the new method is 
ntot — n ran d, where n ra nd is the average number 
of correlations obtained when cosmic rays are un- 
correlated to the data set of potential sources. 

A better measure of ntme is summarized by 
the following equations, where the superscript in 
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Figure 1: The total number of correlations vs. 
the detector resolution. Error bars (slighly sepa- 
rated horizontally for readability) contain 90% of 
the cases. Top: Random sources; Bottom: actual 
BLLac positions -30° < b < 90°. 

parentheses labels the "order" of refinement, giv- 
ing successively better approximations to n true : 
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where n ran d(N) is the average total number of 
correlations found with TV randomly generated 
events, and where / is the average fraction of true 
events that are recovered as correlated. This frac- 
tion will typically be slightly smaller than unity 
since under the assumption that true correlations 
are separated according to a Gaussian distribu- 
tion, some events are separated too far from their 
sources to be accepted as correlated events. We 
measure / by simulations. 

Testing with simulations 

In vetting the two methods with simulations, we 
test their ability to correctly reproduce the number 
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Figure 2: Correlations found by HiRes method 
with 10 true correlations, M sources and N events 
in 100 square degrees. 

of true correlations on mock data sets. This allows 
us to explore the effect of large event and source 
densities, the effect of anisotropy in the source dis- 
tribution, and the consequences of having incorrect 
event resolutions. 

We begin by testing the ability to reproduce the 
number of true correlations in the simplest case of 
dilute, random sources. A first simulation is done 
using half the sky, uniform detector exposure R(x), 
156 randomly distributed sources and 271 cosmic 
rays (the numbers relevant to the BLLac studies of 
[2] and [1]). Ten of the cosmic rays are Gaussianly 
aligned (a cosmic ray paired to a source, with angu- 
lar separation according to the probability density 
Q, taken to be a 2d Gaussian of width a), for var- 
ious event resolutions. A second simulation uses 
the actual BLLacs source positions; these are more 
clustered than the random case. As shown in figure 
1 , both methods reproduce well on average the cor- 
rect number of correlations, with similar error bars 
(which include 90% of the 10k realizations). As 
(7 increases, the dispersion in the number of found 
correlations increases rapidly. 

For samples with very large numbers of events and 
of potential sources, we expect the generalized ML 
method to perform worse than the HiRes method, 
beacause rS 2 ^ is obtained by taking the difference 
of two very large numbers, rv- Q > and n ran d- More- 
over, as the source density becomes very large 
it becomes impossible to reliably distinguish the 
"contributing" sources. The total number of true 
correlations becomes the only interesting quantity 
to calculate. Thus, only the HiRes method should 
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Figure 3: Average number of found correlations as 
a function of the factor by which a is rescaled by 
when "Gaussianly aligning" sources. 

be used for the case of very high event densities. 
However, the HiRes method also deteriorates at 
high densities, as shown in figure 2. 

If the resolution of cosmic ray events are consis- 
tently over- or underestimated in a given data set, 
the extracted correlations will be incorrect. To test 
the sensitivity of the two methods to this problem 
we repeat the first type of simulations, but rescale 
the event resolution when aligning a cosmic ray to 
a source. In figure 3 the average number of found 
correlations for the two methods are plotted as a 
function of the amount by which a is rescaled. The 
new method is far less sensitive to incorrectly esti- 
mated resolution than is the HiRes method. 

Significant spatial correlations within the data set 
of potential sources may skew the found number 
of UHECR correlations. In figure 4 we show the 
results of a simulation with clustering of poten- 
tial sources introduced by hand. The figure shows 
the mean for 10k realization of 271 cosmic ray 
events with a — 0.4, and two different scenar- 
ios for the correlation with source clustering, for 
156 candidate sources. In both cases, candidate 
sources and CRs are distributed over one hemi- 
sphere; ten clusters of candidate sources are ran- 
domly distributed on the sky, each consisting of ten 
individual candidate sources distributed around the 
cluster center according to a 2d Gaussian of width 
d degrees. The remaining 56 candidate sources 
are placed at random in the hemisphere. In the 
first case, a randomly picked candidate source in 
each cluster has one cosmic ray event Gaussianly 
aligned to it and the remaining 261 cosmic rays 
are placed at random. In the second case, ten CRs 



Maximum Likelihood method for UHECR correlations 



11 




10 








9 










y 

/ 

/ 


HiRes: case A 

HiRes: case B 




8 


/ 

/ 


n (2) : case A 






/ 


n (2) : case B 




7 

( 








) 1 2 

d [° 


3 4 


E 



Figure 4: Sensitivity to clustering in source dataset, 
for UHECRs from (A) dense or (B) sparse regions. 

are Gaussianly aligned with ten of the randomly 
placed source candidates and the remaining CRs 
are placed at random. As figure 4 demonstrates, 
the HiRes method significantly overestimates the 
true number of correlations if the sources are in 
clustered regions and underestimates it when the 
candidate sources show significant clustering but 
the UHECRs do not come from the clustered re- 
gions. By contrast the new method performs well 
in this test. 

Application to BLLacs 

The binned analysis performed in [2] on the sam- 
ple of 156 BLLacs with optical magnitude m < 18 
from the Veron 10th Catalog [5] and the 271 HiRes 
events with E > 10 19 eV showed a correlation at 
the 10 -3 level. This was subsequently corrobo- 
rated using the HiRes method [1], with n = 8.0 
found correlations; the fraction of Monte Carlo 
runs with greater likelihood than the real data, was 
found to be 2 x 10~ 4 . Using the generalized ML 
method we find the number of correlations to be 
n (2) = 9.2, with T — 6 x 10~ 5 . As shown in 
figure 1 , the HiRes method underestimates the cor- 
rect number of correlations by 0.7 on average in 
these conditions. However, the difference in these 
results is consistent with the dispersion in simula- 
tions. For a list of the BLLacs correlated to UHE- 
CRs and further analysis, see forthcoming paper. 



Conclusions 

We have introduced a generalization of the HiRes 
Maximum Likelihood method, which allows the 
most likely sources of individual events to be iden- 
tified and ranked. Using simulations we have 
tested the two Maximum Likelihood methods and 
find that they complement each other well: the 
HiRes method allows a fast way to estimate the 
number of true correlated events, while the new 
method gives the quality of correlation between in- 
dividual sources and cosmic rays rather than just 
the total number of correlated events. Furthermore, 
the new method is less sensitive to the validity of 
the estimated angular resolution, and is better when 
candidate sources are clustered (as BLLacs are). 
We conclude that both methods should be used; if 
they disagree markedly on the total number of cor- 
relations the data sets may be aberrant. Applying 
the new method to the case of BLLacs, confirms an 
excess of correlations between HiRes cosmic rays 
with E > 10 19 eV [1] and BLLacs of the Veron 
10th catalog. The excess obtained is slightly larger 
(9.2 rather than 8.0 events) than with the HiRes 
method, but the two values are compatible within 
the range of variation found in simulations. 
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